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The average node-to-node distance of scale-free graphs depends logarithmically on A'', the number 
of nodes, while the probability distribution function (pdf) of the distances may take various forms. 
Here we analyze these by considering mean-field arguments and by mapping the m — 1 case of the 
Barabasi- Albert model into a tree with a depth-dependent branching ratio. This shows the origins 
of the average distance scaling and allows a demonstration of why the distribution approaches a 
Gaussian in the limit of N large. The load, the number of shortest distance paths passing through 
any node, is discussed in the tree presentation. 
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I. INTRODUCTION 

Recently, many examples have been found of systems 
whose innate topology is not homogenous and can rather 
be described in terms of a scale-free, random structure. 
Examples range from the Internet to cellular metabolism 
networks. The interest of the physics community in this 
field stems from the fact that the behavior of many sys- 
tems on such networks or graphs changes drastically and 
often attains characteristics close-to but not quite like 
the mean- field limit. 

A scale-free graph consists of a set of nodes or vertices 
V and bonds or edges E connecting the vertices to a 
structure. The essential measure of the scales or lack 
thereof is the connectivity or degree distribution of the 
nodes V: the probability of any node to have k edges 
(one may distinguish between directed and undirected 
graphs; in the former case the in-going and out-going 
pdf's can differ). If this probability Pk follows a power- 
law behavior, a structure arises that does not have any 
intrinsic scale. The Internet is an example of such a Pfc ~ 
k^'' , and several models have been designed that fit the 
same description. Later on enhanced models have been 
devised to capture the characteristics of more elaborate 
phenomena, like the tendency of clustering [|l], ||, ||, ^. 

The models lead to evolving graphs which grow con- 
tinuously in time by the addition of new nodes, with only 
a limited number of notable exceptions where the scale- 
free graph is generated by means of a Monte-Carlo algo- 
rithm |5[|. The degree distribution Pk and average con- 
nectivity become stationary in the thermodynamic limit, 
save for the tail of the distribution which is subject to 
finite-size cutoff effects |^ . A practically minded ques- 
tion in the same spirit is the growth mechanism of the 
Internet 

It is a common feature of growing networks that 
they spontaneously develop degree-degree correlations 
between adjacent nodes This is a manifesta- 

tion of the preferential attachment [ pd]| principle, where 
more connected nodes are to attract a larger proportion 
of new links as the network grows. One recent study hints 



that the correlation between neighboring node connectiv- 
ities is the mechanism behind the logarithmic scaling of 
the network diameter or the average shortest distance 
between two randomly chosen vertices, with respect to 
the system size fl^ . The support for the argument is 
empirical evidence from simulation results of a broader 
class of scale-free graph ensembles, where a power law 
growth of the diameter has been indeed identified. The 
question of viability of logarithmic scaling in real-world 
networks is particularly essential, since it has an impact 
on efficiency and percolation issues (communication over 
the Internet, spreading phenomena |l3| , [l^ , community 
structures 15, [l6|). 

Until recently, less attention has been paid to the prob- 
ability distribution of shortest path lengths or sometimes 
referred to as chemical distances in scale- free graphs, pos- 
sibly owing to the fact that it has been implicitly assumed 
that the average diameter is an adequate measure of dis- 
tance properties in the networks. The particular form of 
the distribution function may have bearings on the 
formance of search algorithms in scale-free graphs 
On the other hand, the distribution of shortest paths has 
been analytically calculated for the small-world model, 
employing the underlying lattice structure ||l8| and arriv- 
ing at a Gaussian-like distribution for large system sizes. 
Likewise, a model for deterministic scale-free graphs has 
been proposed and analyzed lately Q , where a Gaussian 
is again obtained in the asymptotic limit. 

In this paper we focus on a subset of scale-free graphs 
described by the Barabasi- Albert model 1 1 1 that in ad- 
dition are loopless rooted trees from a topological point of 
view, i.e. the m — 1 case where one connects new nodes 
by only one link to the existing structure. By remov- 
ing the redundancy of interconnecting loops it is possible 
to consider the distance properties on a mean-field level, 
and also to analyze "load" or "betweenness" j20[ 
the number of any shortest paths passing through ver- 
tices. The essential fact here is that the hub of the tree, 
e.g. the node with the highest connectivity for simplicity, 
transmits connections between all the branches emanat- 
ing from it. We show that a stochastic branching process 
rooted in the preferential attachment rule gives rise to 



2 



the logarithmic scaling of the diameter and that the pdf 
of the minimal paths approaches a Gaussian. 



Since throughout this text we are interested in tree 
structures, it is useful to overview their basic features. 
In the context of random networks, one often resorts to 
Ref. and the derivation therein, which suggests that 
the diameter of graphs grows logarithmically. Although 
the calculations there are performed for random graphs 
containing loops, the result obtained closely resembles 
that for balanced Cayley trees with uniform coordination 
numbers (except for the coordination number of the cen- 
tral node, which is different). According to this, the num- 
ber of nodes separated from node zero by k nonrecurring 
steps goes as z'^ where z is the coordination number for 
the Cayley tree. It then follows simply from the sum 
of a geometric series that both the longest distance be- 
tween nodes and the average distance should behave as 
In TV. 



It is obvious that trees have unique shortest paths be- 
tween any two nodes in the sense that without traversing 
the same edge twice it is not possible to find an alternate 
minimal route (unlike in unweighted graphs with loops, 
where there is usually more than one minimal path) . We 
can then define one of the nodes as the root of the tree and 
unambiguously arrange all the other nodes into layers de- 
pending on their minimal distance to the root. Finding 
the shortest path between two chosen nodes is nothing 
but identifying the deepest common node along the paths 
leading from the root to the source and target vertices 
and then connecting the two nodes via this common fork. 
Notice that the choice of the root here is slightly arbi- 
trary; one would prefer to use balanced trees. 



We study scale- free Barabasi- Albert trees start- 
ing with a single vertex. In each time step then we add a 
new vertex with only one outgoing edge. The other end of 
the edge is connected to one of the nodes already present 
in the system with a connection probability proportional 
to the connectivity or degree of a particular node. All 
edges are thought of as bidirectional and having the same 
weight, namely 1. As a slight modification to the original 
model, the connection symmetry of the first two nodes is 
broken by introducing a "virtual" edge to the very first 
vertex which only gives preference for this node over the 
second one when it comes to the subsequent addition of 
further nodes. This way, we can automatically identify 
the most connected node in the network and call it its 
root. To have a balanced tree one needs every subtree of 
the root to have the same number of nodes in the con- 
figurational average. This is only attained when the root 
is the most connected node, since the BA model ensures 
that the order of the nodes in terms of connectivity does 
not change in the course of addition of new nodes and is 
fully determined by the time of introduction of a node. 



II. MEAN-FIELD APPROACH 

To begin with, we will investigate the shortest path 
distribution in a mean-field model of a tree network, be- 
tween the root of the tree and all the other nodes. This 
argument extends also to general graphs in the case that 
the new nodes added (e.g. m > 1 Barabasi-Albert net- 
works) do not cause a significant amount of shortcuts 
between already existing nodes. 

Let us consider a uniform branching process for each of 
the layers in the tree so that every node on a certain layer 
has the same number of offsprings to produce the next 
layer beneath; it shall amount to b{l) for layer I for short. 
This way the original stochastic model is approximated 
by a deterministic graph | [l9| , p3| . The number of nodes 
n{l) with a separation I from the root is then n(l) = 
n{l - 1) b{l - 1) with the condition that n(0) = 1. The 
actual form of b{l) can be obtained by making use of the 
preferential attachment rule for BA networks. According 
to this, the probability that a newly introduced node will 
connect to any given set of nodes is proportional to the 
cumulative connectivity of the set in question. Thus, 
the number of nodes on layer I + 1 changes according to 
the following rate equation, due to the addition of a new 
node: 



dn(l + 1) 1 , , 

7^ = (b(l) + l)n(l), 



(1) 



since the right hand side describes the attachment prob- 
ability to layer I, where N is the number of nodes in 
the system and 2N is the normalization factor for the 
connectivity. Writing n{l -I- 1) = n{l) b{l), expanding the 
derivation, and dividing by n{l) b{l) give 
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b{l) dN 2N \b{l) b{l-l) 



(2) 



If we substitute B{N, I) — l/b{l) by expHcitly indicat- 
ing the size dependence on N and assume that B{N, I) 
is a slowly varying function. 



dliiB{N,l) 1 dB{NJ) 



ON 



2N dl 



(3) 



It is straightforward to expect a solution in the decom- 
posed form of B{N,l) = Bi{l) / Bn{N): 



a In Bat (iV) 
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dBi{l) 



2N Bn{N) dl 



(4) 



and since the left hand side is a function of only N , 
Biil) = 2al with a constant a. Finally, we get Bfq{N) — 
a\nN and 



b{l) 



1 \nN 

2~r' 



(5) 



This relation does not apply to the root [l = 0) for 
obvious reasons. Eq. (^ also implies that the number of 
nodes with a given distance to the root n{l) keeps growing 
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with I until b{l) drops below 1 and then starts to decrease, 
as the bottom of the tree is approached {n 1). This is 
in strong contrast to the formal prediction of a constant 
branching for any random graph , which would result 
in a monotonous exponential growth, as would be the 
case in usual Cayley trees. 



III. DISTANCE SCALING 

Using the recursion relation n{l) — n{l — 1) b{l — 1) 
for the number of nodes on a given level, we can give an 
estimate for the shortest path distribution function with 
the source of the paths at the root of the tree. Instead 
of Eq. (|^) we take now the more general form of b{l) = 
{A/l)^ and approximate the sum with an integral in the 
following expression, which reads 



i-i 



i{l) = n(0) = 6(0) exp 

1=0 

« 6(0)^^('"i' exp ^-A^' 



i-i 



ln^6« 



i=i 



\nxdx 



5(0) / Ae 



(6) 



The result above for n{l) approaches a non-normalized 
Gaussian in the large- limit as ^ '--^ In iV goes to infinity, 
which can be seen from Fig. 1 where a C corresponding 
to a very large network has been used. In order to draw 
further conclusions, we will determine the parameters of 
the Gaussian that give a best fit to n{l). For the sake of 
simplicity, let us now consider the function of the form 
f{x) = {C/x)^-: 



i?exp 



{x-^^y 



2a^ 



(7) 



We first match the extremal point of / to the mean of 
the Gaussian, resulting in /i = C'/e. The maximum value 
is thus R = exp(CA/e); the standard deviation a can be 
obtained by the requirement that the derivative functions 
of / and the Gaussian be t he same in the vicinity of /i up 
to first order, giving a — ^ C j (Ae). Using the parameters 
acquired this way, we can find a very good approximation 
to /(x), which is almost identical to that of a least-square 
fit. 

Furthermore, additional information can be gained if 
we look into the normalization conditions for n{l). Triv- 
ially, the sum of n(V) over all layers should return the 
total number of nodes in the system, N . Again, we ap- 
proximate the sum with an integral: 



1=1 

6(0) 



n[x)ax = — — 
1 e Jo 



X 



dx 



e^^ exp 



(x - Af 
2A/\ 



dx 




FIG. 1: A Gaussian fit for the function f{x) = {C jxY with 
C = 300 e. A few points of the Gaussian are represented 
by the dots. The difference is only noticeable at the tails of 
the functions. The inset shows how the quadratic error of the 
two functions (normalized for area) appears to be a decreasing 
power law with increasing C . 



2ttA 



A 



(8) 



where we assumed that A is large enough so that we 
can neglect the correction of the error function to the 
Gaussian integral. We should also be aware that I has a 
finite cutoff because of the bounded depth of the tree — 
yet, the quickly vanishing n{l) makes it possible to take 
I to infinity. Finally, /(a;) ^ 1 as x — > so that the 
integrand is bounded everywhere. 

Recall now that the degree of a node in BA networks 
grows with the power of N, 6(0) ^ Apart from 

6(0), the only term on the right hand side of Eq. (^ that 
may contribute to the overall linear growth in N is e^"^, 
which increases much faster than ^/A^ so the latter can 
be taken a constant. The consistency condition with the 
left hand side requires that e^^ ~ N'^~^ should hold, and 
thus 



A = ^ ^ In N + const. 
A 



(9) 



Disregarding the constant, we end up with a very sim- 
ilar but more general expression than that of Eq. (g|) for 
6(0: 



6(/) 



TniV 
"1 



with l3 + vX = l. 



(10) 



This implies that if a scale-free tree is characterized by 
a branching process decaying as a power law as a func- 
tion of the distance from a suitable root node with the 
highest connectivity, the relation (|l^) should necessar- 
ily be satisfied. Not surprisingly, it is true in the case 
of BA trees, where /3 = 1/2 and according to Eq. (||), 
V = 1/2 and A = 1. One should note that in the process 
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of constructing the mapping we rely on the fact that the 
number of nodes in a layer depends only on the average 
branching ratio b{l); the fluctuations in the degrees of 
nodes is omitted. For this reason the degree distribution 
exponent —3 is not present in the tree representation. 

The node-to-node distances in the mean-field model 
are calculated as follows. We traverse each node of the 
tree and enumerate the routes with certain lengths that 
start at or go through this node and have both of their 
ends in the subtree of the node. Practically speaking, we 
can think of this node as the root for its subtree and per- 
form the same calculations as we would do for the 'global 
root' of the tree. If we by n*^ denote the number of 
possible paths going out to the subtree of a node on level 
s that have length / and one end fixed at the node on 
level s, 



i-i 



(11) 



Let now r(*)(Z) be the number of all routes that go 
through or end at a particular node on level s and have 
a length of I, 



i-i 



(0 + e(6(s)-i)^ 



b{s)\n^'^{i)n(''^l-i) 



t V 2 ; b{s) bis) 



The second term in the sum has contribution to r*-^-* (l) 
only when there are branches left going out from a node, 
in average when b(s) > 1. The number of paths with a 
specific length in the whole system is therefore 



(13) 



the number of 



where n{s) is defined as earlier in Eq. 
nodes on a given level s. 

The Barabasi- Albert model allows for more rigorous 
derivations of the relation for n(l). The mathematics 
community often refers to the tree interpretation of the 
model as recursive trees, and thus exact results have been 
obtained for both the distance distribution and the diam- 
eter of the trees |2^, ^ . Bollobas and Riordan give 
a gener al p roof for the diameter scaling of scale-free BA 
graphs • The mapping to Cayley trees also resembles 
the work of Krapivsky and Redner, who arrive at a closed 
recursive analytical form for n(l), in a more general con- 
text than that of scale- free trees It also resembles 
Cayley models of Internet traceroutes by Caldarelli 
and co-workers. 




FIG. 2: The average number of branches per node normalized 
with the logarithm of the system size, represented versus the 
minimal distance of the nodes from the root with maximum 
connectivity for the BA model. A power-law fit has been 
performed in a window indicated by the heavy line, giving 
b{l)/\nN ~ 0.43 r°®^^^ The exponent is very close to -1. 
The inset shows b(l) plotted against the normalized minimal 
distance. The systems range from 10'^ to 10® nodes in size 
with logarithmic increments. The number of iterations for 
the systems go from 10^ to 100, depending on the size. A*'. 



IV. COMPARISON WITH SIMULATIONS 

Numerical simulations of BA scale-free trees fully con- 
firm the inferences drawn in the preceding section. Most 
important of all, the average number of branches per 
node on a given level is shown in Fig. |^. The numer- 
ical parameters of the power-law fit conform with the 
mean-field values: the exponent of the decay is almost 
exactly —1, and the prefactor of the logarithm with 0.43 
is also close to that of the predicted 1/2. It is also worth 
to note that if we rescale the distance variable by the log- 
arithm of the system size, we can attain a data collapse 
with a very good accuracy. This means that for BA trees 
in practice b{l) can be approximated as 



6(0 



iil<L{N) 
Ki otherwise 



(14) 



From the inset of Fig. ^ it is also apparent that the 
cutoff L{N) is a little over the value of In A^, by a factor 
of about 1.3. On the other hand, the drop of b{l) at L{N) 
is measured to be either an exponential or a power-law 
with a very large exponent. The mean- field prediction for 
the maximum of the shortest path length, L{N), can be 
obtained by equating n{L) = 1 in Eq. (^ and using the 
Gaussian approximation of Eq. (^. The solution up to 

first order in In A^ is that L{N) « InN, which again 
is in reasonable agreement with the mean- field argument. 

The derived quantities n{l)/N and the node-to-node 
distance distribution is shown in Fig. |^ for two distinct 
cases: 
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FIG. 3: Root-to-node (left) and node-to-node (right) distance 
distributions with circles (BA model) and their predicted val- 
ues with squares. The prediction is based on Eqs. (pT[)-(p^. 
Trees of 10® nodes are measured and averaged over 100 re- 
alizations. The dashed lines show the least-square fits with 
the function [C/ {I — 1)] to the measured data points. The 
constant for root-to-node distances is Cr ~ 15.7 and for node- 
to-node distances C„ = 33.2. Cr is in a very good correspon- 
dence with the analytical value of Cr = 0.43 e InA*' = 16.2 
and C„ ~ 2Cr- 



• the root-to-node and node-to-node shortest path 
distribution is measured in an ensemble of random 
BA trees using simulations. Instead of every possi- 
ble pair, the node-to-node distances are measured 
only between a large but finite number of randomly 
selected vertex pairs, for practical reasons; 

• both distribution functions are estimated by utiliz- 
ing the mean-field tree mapping, using the asymp- 
totic form of Eq. ( p^ for b{l) with a cutoff at 
L = 1.21niV. 

It is to be seen that a very good correspondence is 
found between the root-node distribution functions, but 
the overall two-point pdf's are sensibly close as well. 

While it has been relatively easy to derive analytical 
results for the root-node distances in the mean- field trees, 
Eq. (|l^) and the quantities it is constructed of turn out 
to be too complex to handle without numerics. The for- 
mulas (pT|)-(p^) above are used to calculate the approxi- 
mate values of the node-to-node path length distribution 
in the mean-field trees using the expression of Eq. (|l^ ) 
instead of the analytical form of Eq. (j|), so as to better 
represent the random BA trees. It is reassuring that the 
generic form of the nodc-to-nodc distance pdf also follows 
a {C /xY function, only with a different C constant from 
that of C for the root-node distances [Eq. (||)-(0)]; see 
Fig. ||. The diameter of the trees relative to the logarithm 
of the system size can be seen on Fig. ^. [1) « IniV or 
in other words twice the mean of root-to-node distances. 
This is somewhat expected as the main contribution to 



FIG. 4: The diameter and mean depth of networks of different 
sizes, divided by the logarithm of their size. Circles represent 
the mean of node-to-node distances, while squares the mean 
root-to-node distances. Both are apparently proportional to 
In A*', and the prefactors of ^ 0.5 and ~ 1 are in very good 
agreement with their respective analytical values. 



the node-to-node paths arises from passing through the 
root, for large graphs. It leads to a convolution- type dis- 
tribution (from the two 'legs'). It can easily be seen that 
the diameter cannot exceed twice the depth of the tree, 
which gives rise to a logarithmic growth in any case. 



V. ON THE "LOAD" ON TREES 

On a hierarchical structure the total number of min- 
imum paths going through a node (the "load") can be 
divided into two contributions. First, those paths that 
connect nodes in separate sub-branches of the node to 
each other, and, second, those that connect the nodes 
belonging to the branches to the rest of the tree. Call 
d{l) the number of the descendants of a node on level 
I. In other words, d(l) is the size of the subtree for the 
node. Then the load can be written simply as 



A(0 



+ d{l) X (iV-d(O), 



(15) 



where the last term counts the connections towards the 
hub. 

For the particular example we are concerned with, it 
is easy to see that the latter term dominates (TV :s> d{l)) 
and moreover that a good approximation is given by just 
simply A(/) « d{l) N. Thus one may investigate the de- 
pendence of the load on the level (or depth) of the tree, 
I. For d{l) in the mean-field picture one has that 
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FIG. 5: Load distribution for mean-field trees modeling BA 
networks. The probability P is proportional to n, the number 
of nodes on the levels of the tree. The load on the root has not 
been showed since it does not average. The inset shows the 
load distribution for a usual Cayley tree with a coordination 
number z = 2. The bold lines indicate power law fits, which 
give exponents of —0.99 and —1 for the BA and the Cayley 
trees, respectively. The mean-field tree is a mapping of a 
random BA tree with 10^^ nodes. 



and for the layer immediately below 

=i+2"W _ dil)n{l) - n{l + I) 



d{l + 1) 
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where we also used the recursion relation for n{l + 1). 
Finally, the load changes for the layer underneath as 



(18) 



Since the load A(^) is the same for each of the nodes on 
a particular level /, the distance-load distribution is di- 
rectly given by the normalized n-A function, thus hiding 
the implicit dependence on I. Considering that 



Ail + l) = 



m 

n{l + l) = n{l)b{l), 
A{1 + 1) n{l -I- 1) = A(Z) n{l) — const, and therefore 

const. 
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(19) 
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We then expect to see that the load is inversely pro- 
portional to the number of nodes on the levels, which 
is indeed the case according to Fig. |[ The same result 
holds for normal Cayley trees from Eq. p5[). 

Note that we have to use mean-field trees which would 
correspond to random networks with a large number of 



FIG. 6: Betweenness as a function of the descendants for every 
node in the network. Trees of size 10* are taken with 100 
realizations. The root which descendants are defined down 
from is always the initial node. The prediction of Eq. ( pi] ) is 
represented by the solid line. 



nodes so that the number of levels is of the order of 10. 
For the load distribution we consider only levels for which 
b{l) > 1 because otherwise subtrees do not exist in the 
average sense. It is surprising that the load distribution 
exponent does not depend on the actual form of b{l), 
being universally —1 [Eq. (^|)]. Indeed, the exponent of 
the distance-load pdf is independent of the choice of the 
node that all the distances are taken relative to. 

Another common way of defining the importance of the 
nodes in terms of shortest paths passing through them is 
the one called betweenness, favorable for its algorithmic 
feasibility and simplicity. Newman presents a breadth 
first search algorithm for efficient calculation of the be- 
tweenness of nodes on random graphs The only 
notable difference to Eq. (|l5|) comes from the fact that 
the betweenness also accounts for paths that originate 
from the nodes themselves, which nevertheless amounts 
only to a constant system size. 

We will calculate the betweenness on the trees, now 
focusing as a goal on the probability distribution of the 
load. An estimation can be given for a node by consider- 
ing the contributions to it, and by separating the network 
to a descendants part with d nodes in the branches and 
all the rest with N — 1 — d nodes. The node being the 
source, we have N shortest paths to any other node; if the 
source is among the descendants, we have d{N—l — d+l) 
ones going through; if the source is any other node from 
the network, we have (iV — 1 — d){d + 1). A fourth con- 
tribution, coming from paths passing through the node 
but having both ends in the descendant tree, has been 
neglected. They add up to an estimated betweenness K 
of 



K{d) w 2iV - 1 + 2{N - l)d - 2d^. 



(21) 
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FIG. 7: Betweenness pdf for a system of 10* nodes. The 
exponent of the power law shown is —1.99. The inset displays 
the logarithmically binned pdf of the descendants for systems 
with lO'' nodes. Its power law exponent is —1.99 as well. 



Here it is to be seen that for small d's the linear term 
dominates, just as in our previous load calculation; Fig. ^ 
justifies our estimations. 

The betweenness probability distribution P{K) taken 
over all nodes in the network can then be concluded to 
asymptotically follow a power law decay with a universal 
exponent of —2. This is since K is linear in the num- 
ber of descendants d and moreover that the pdf of d 
scales universally with an exponent of —2 for supercrit- 
ical trees [Q. Strictly speaking, the conclusions here 
are only true for the supercritical part of the tree, i.e. 
where h{l) > 1. The subcritical leaves of the tree have 
an increasingly smaller number of descendants, though, 
which drop exponentially with each new layer, and it 
can be verified that the descendant pdf decay exponent 
is indeed above 2 if only this part of the tree is consid- 
ered. Nevertheless, Fig. |^ shows that both the descen- 
dant pdf and the load pdf are accurately described by 
inverse square functions. A scaling of the load distribu- 
tion has been experimentally found on other scale-free 



networks as well [gO 
versal exponent of a 



only with a slightly different uni- 
bout 2.2. 

A further, practically more far-reaching observation is 
that the average betweenness as measured as a function 
of the locally known node degree grows as a power-law 
of the degree with an exponent of about 1.8 (Fig. ||). 
A mean-field approach can be used to estimate for the 
exponent, though, if we consider that the preferential 
attachment principle for large degrees gives rise to a 
descendant-degree scaling of d ~ fc^ {P = 1/2), which is 
the inverted relation for the time evolution of the degree 
of a parent node In this particular case, time is mea- 
sured as the size of the node's subtree. A substitution of 
the latter into the linear load equation would suggest an 
exponent of 2; the deviation from it may come from the 
rather restricted range of the degree that the relatively 



FIG. 8: Average betweenness for nodes with a particular de- 
gree, taken over 100 realizations of networks with 10* nodes. 
The fit of a power law indicates an exponent of about 1.78. 
The inset shows the average number of descendants versus 
the degree for systems with lO*" nodes, for which a power-law 
fit gives d{k) ~ 



small system sizes allow. 



VI. CONCLUSIONS 

In this paper we have mapped scale-free Barabasi- 
Albert trees to a deterministic model of a rooted tree 
with a uniform branching process on each layer of the 
tree. This idea resembles work on the Internet struc- 
ture 1^ and on the structure of branched cracks, where 
an inverse relation of the branching to distance has been 
observed pl| . 

Simulations show that the distribution of the number 
of branches on one particular layer of the tree follows a 
power-law function, but it turns out to be a good ap- 
proximation to describe the branching only by its mean, 
b{l). In the simple case of BA trees it can be shown by 
means of this mapping that the diameter of the networks 
is bounded by the logarithm of the network size and the 
asymptotic form of the distance distribution functions 
follows immediately. In other words, we can examine the 
slow convergence of this function to the limiting Gaussian 
form for infinite system sizes. Given an effective descrip- 
tion in terms of a tree plus a branching process, further 
information can be found, e.g. one may consider the scal- 
ing of the number of shortest-distance paths (load, or be- 
tweenness). Non- uniform critical trees could perhaps be 
constructed in a self-organized fashion, as is possible for 
the statistically uniform case js^. 

One should note the close relation of the Cayley repre- 
sentation to minimal spanning trees (MST) on scale-free 
(random) networks; for the m — 1-networks these two co- 
incide. This makes it an interesting prospect to study the 
load and distance properties of MST's in other scale- free 
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networks. 
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